SKESA: strategic k-mer extension for scrupulous assemblies
نویسندگان
چکیده
منابع مشابه
KAT: a K-mer analysis toolkit to quality control NGS datasets and genome assemblies
Motivation De novo assembly of whole genome shotgun (WGS) next-generation sequencing (NGS) data benefits from high-quality input with high coverage. However, in practice, determining the quality and quantity of useful reads quickly and in a reference-free manner is not trivial. Gaining a better understanding of the WGS data, and how that data is utilized by assemblers, provides useful insights ...
متن کاملStatistics for K-mer Based Splicing Analysis
It is well acknowledged that alternative splicing module plays a crucial role to identify the variations of the RNA transcriptomes. In high-throughput short-read RNA, splicing analysis is a challenging task due to the uncertainty and time complexity of reads alignments onto genome and transcriptome. In this paper, we introduce k-mer based statistical method for splicing event analysis. The k-me...
متن کاملThe Minimal k-Core Problem for Modeling k-Assemblies
The concept of cell assembly was introduced by Hebb and formalized mathematically by Palm in the framework of graph theory. In the study of associative memory, a cell assembly is a group of neurons that are strongly connected and represent a "concept" of our knowledge. This group is wired in a specific manner such that only a fraction of its neurons will excite the entire assembly. We link the ...
متن کاملCompact Universal k-mer Hitting Sets
We address the problem of finding a minimum-size set of k-mers that hits L-long sequences. The problem arises in the design of compact hash functions and other data structures for efficient handling of large sequencing datasets. We prove that the problem of hitting a given set of L-long sequences is NP-hard and give a heuristic solution that finds a compact universal k-mer set that hits any set...
متن کاملBacterial population assay via k-mer analysis
Identifying and assaying the relative abundance of members of complex microbial communities is an important problem in ecology. Sandberg et al. investigated the usage of genomic signatures to provide high identification percentages from short sequence samples. In this paper we present an improved naive Bayesian classification method using conditional probabilities, which can be used to classify...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Genome Biology
سال: 2018
ISSN: 1474-760X
DOI: 10.1186/s13059-018-1540-z